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FINDINGS OF THIS STUDY INDICATE THE NECESSITY OF 
REEXAMINING THE UNMODIFIED USE OF SCHOLASTIC AFTITUDE LETTER 
GRADE DISTRIBUTION AS THE BASIS FOR THE ACHIEVEMENT LETTER 
GRADE DISTRIBUTION OF A CLASS. THE USE OF THIS SYSTEM IN 
BRITISH COLUMBIA IS BASED ON THE ASSUMPTION THAT FOR ANY SIZE 
SAMPLE OF STUDENTS, THERE WILL BE A HIGH CORRELATION OF I.Q. 
SCORES AND CLASSROOM ACHIEVEMENT GRADES. THE STUDY REVEALED, 
HOWEVER, THAT THE EXTENT OF DEVIATION BETWEEN THE TWO WAS 
CONSIDERABLE. THE ARGUMENT THAT THIS PRACTICE GROUPS THE 
ACHIEVEMENT LETTER GRADES AROUND A MID-FOINT APPROPRIATE FOR 
THE CLASS WAS FOUND TO BE INCORRECT FOR MANY CLASSES IN THE 
STUDY. IN BOTH THE HIGH AND LOW SCHOLASTIC APTITUDE GROUPS, 
ABOUT AS MANY CLASSES WERE FOUND TO UNDER-ACHIEVE AS TO 
OVER-ACHI EVE IN RELATION TO THEIR SCHOLASTIC APTITUDE MEAN. 
CERTAIN PROCEDURES MODIFYING THIS METHOD ARE RECOMMENDED FOR 
TEACHER USE IN GRADING ACHIEVEMENT. (HM) 
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A periodic and difficult task of the classroom 
teacher is the evaluation of the achievement of each 
of his pupils. Essentially, grading a student’s work 
may be accomplished in one or both of two ways. 
The first involves a comparison of the student’s 
achievement with an absolute standard. This results 
in a mark or score which may be expressed, for 
example, as 20 out of 25 words correct in spelling, 
or 80 percent in arithmetic. The second uses a com- 
parison of the student’s achievement with the achieve- 
ment of other students taking the same work. This 
can be expressed as a rank in the group, for example, 
14 out of 40, or as a percentile rank (out of 100), 
or as a letter grade which represents a range of 
several ranks. 

4 

The main difficulty in the use of an absolute stand- 
ard against which to judge the achievement of pupils 
lies in the establishment of that standard. This is 
usually done on the basis of the subjective judgement 
of the examiner, who "feels” that he has set a 
reasonable standard to be met. The students or 
other teachers may think otherwise and consider his 
examination to be very difficult, too easy or unfair. 
Very rarely do such arbitrary standards have any 
external validity other than that , *’hich resides in 
the past experience of the teacher with similar pupils 
taking a similar course. 

The problems inherent in establishing absolute 
standards of achievement by which to judge students, 
have led educators in British Columbia, to a wide 
acceptance of the second system, that of stating the 
rank of a student in his group. To avoid the 
impression of a greater accuracy than really exists 
in the allocation of school marks, several ranks are 
often placed together and a letter grade assigned to 
the groups of ranks. The system usedin British Columbia 
consists of seven letter grades, A, B, C+, C, C— , 
D and E, representing a 5, 20, 15, 20, 15, 20, 5 
percent distribution. Thus achievement considered 
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to be in the top 5 percent of the class, for example, 
the top 2 or 40 students, is designated "A” 
achievement. Achievement in the next 20 percent 
for example, from the third to the tenth place in a 
class of 40 students considered to be "B” 
achievement, that in next 15 percent "C+” 

achievement, and so on. 

While the method of rank ordering students and 
assigning letter grades to the ranks escapes the 
problems inherent in imposing arbitrary standards, 
it nevertheless creates other difficulties in British 
Columbia. This province has a highly centralized 
educational system which aims at uniformity of 
standards throughout. Examinations on a province- 
wide basis are sometimes set to ensure this 
uniformity. As a result, it is considered desirable 
that a given letter grade for achievement in one 
school district should represent a similar standard 
in all school districts. In other words, any "A” 
student or "D” student should be an "A” student 
or a "D” student in any school of the province, or 
in any of several classes of a given grade in a given 
school. 

It is apparent that if letter grades are intended to 
denote the same levels of achievement in any class 
in the province, the letter grade system previously 
described cannot be applied without modification. 
Obviously the top five percent of a homogenously 
grouped class of good students will differ in achieve- 
ment from the top five percent of a class of poor 
students. If a letter grade of A is given to the top 
five percent of the students in both groups, then 
the letter grades for each are in no way comparable. 

The teacher’s problem is now apparent. If he 
applies the ranking system rigidly, he has no assur- 
ance that his letter grades will represent comparable 
standards to those of other teachers in other classes. 
If on the other hand he attempts to assess the quality 
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of one student’s work apart from the achievement of 
other members of the class, he is forced into using 
the method of an absolute standard with the 

difficulties previously noted. 

In large schools with several classes in one grade, 
the problem is often met by giving the same 
examinations to all the classes of a given grade. 
It is assumed that because of the larger size of 
the group, the average level of distribution of achieve- 
ment will be comparable to the overall British 
Columbia pattern. Two criticisms of this practice 
may be offered. First, the assumption of average 
standards and normally distributed achievement in all 
large schools is incorrect, as some schools have 
demonstrably better achievement records than others. 
Second, if the classes in a single grade of a school 
have been grouped for educational reasons, a common 
examination tends to vitiate the purposes underlying 
the grouping and may be considered educationally 
undesirable. However, irrespective of the validity 
of these criticisms, the practice of grade wide 
examinations offers no help to the majority of 
teachers who teach in schools with no more than 
one or two classes in each grade. For the majority 
of classroom teachers there exists the problem of 
trying to determine how many of their students should 
justifiably receive a given letter grade from the 
results of classroom tests. 

In an attempt to assist teachers with this problem, 
Conway (1949) suggested that the distribution of I.Q. 
letter grades in a class be used as a basis for deter- 
mining the number of each of the letter grades for 
achievement. The rationale and procedure as stated 
by Conway follows: 

". . . . if we make two assumptions, (a) that the 
influence of the differences between teachers is 
not very great, and (b) that, on the average, groups 
of pupils whose ‘scholastic aptitude is high will 
have high achievement and groups whose intelligence 
is low will have low achievement, we can devise a 
method of determining the number of letter grades to 
assign to any school or class. It may be applied to 
any group irrespective of its size. 

The first step is to convert the pupil’s I.Q.’s or 
M.A.’s to letter grades. This may be done directly 
from the Tables of the B.C. Norms issued with 
certain scholastic aptitude tests, or it may be done 
with somewhat less accuracy from general percentile 
tables. 

If, when we have completed the conversion to letter 
grades we find that we have 3 A’s, 2 B’s, 3 C+’s, 



4 C’s, 6 C— ’s, 11 D’s and 3 E’s according to scho- 
lastic aptitude test results, we may safely assume 
that on the average we also may expect 3 A’s, 2 B’s, 
3 C+ ’s, etc. in achievement. Therefore when we 
have given an achievement test we have only to 
arrange the papers in descending order and to call 
the top 3 scores 'A’, the next 7 *B’, etc. This will 
give an appropriate number of letter grades to each 
class, irrespective of the difficulty of the examination 
they have written.” (Conway, 1949, p. 2) 

In the last few years Conway’s proposal has re- 
ceived wide acceptance from educational admin- 
istrators in British Columbia and the procedure 
recommended is currently in use in many schools. 

Smith (1961) attempted to evaluate the procedure 
recommended by Conway in an urban district of 
British Columbia. Smith designated this procedure 
as the "Standard” method and the awarding of letter 
grades A, B, C+, C, C— , D and E based on a 5, 20, 
15, 20, 15, 20, 5 percent distribution as the "Normal 
Curve” method. The use of the terms "Standard” 
method and "Normal Curve” method will be main- 
tained in this paper. 

Smith’s procedure was to compare two methods of 
assigning achievement grades as follows: 

1. 21 groups were established, each group con- 
sisting of all the grade seven students in a 
given school. 

2. Metropolitan Achievement Test results in 
Arithmetic and Language, and a scholastic 
aptitude measure from the Pintner General 
Ability Test were obtained for each student. 

3. For each of the 21 groups, achievement letter 
grades were assigned to each individual’s raw 
score on each of the Metropolitan Achievement 
sub-tests according to 

a. the "Normal Curve” method. 

b. the "Standard” method. 

4. The "Normal Curve” Achievement grades were 
obtained by taking each of the 21 groups in 
turn. The individual scores within a group were 
placed in rank order and the 5, 20, 15, 20, 15, 
20, 5 percent divisions made to give the number 
in each letter grade category. 

5. The "Standard” letter grades were obtained 
according to the procedure suggested by Conway. 
Local norms expressed in letter grades for the 
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Pintner General Ability Test were used to 
determine the number of each letter grade for 
scholastic aptitude in each group. The same 
number of each letter grade was then given 
to the achievement test results of the group. 

6. A criterion against which to evaluate these 
two methods of determining the letter grade 
was established by converting each achieve- 
ment test raw score into a letter grade by 
applying the "Normal Curve” system to the 
percentile norms issued by the Department of 
Education of the Province of British Columbia. 
Thus each achievement score was treated in 
three ways to convert it to a letter grade. 
First, by ranking the score in the student’s 
school group and applying the "Normal Curve” 
percentages. Second, by ranking the score in 
the school group and applying the number of 
each letter grade obtained from the scholastic 
aptitude test. Third, by ranking the score in 
the results of the whole province and applying 
the "Normal Curve” percentages to obtain 
the letter grade. The third method was con- 
sidered to give the best estimate of the quality 
of the achievement and was therefore used as 
the criterion. 

7. Finally the agreement of the letter grades in 
each group obtained by (a) the "Normal Curve” 
method (b) the "Standard” method with the 
letter grades obtained by use of Provincial 
norms was determined. 

8. "Agreement” was determined by tabulating the 
frequency with which the letter grade obtained 
by each of the two methods was the same as that 
obtained from the Provincial norms. 

Smith reports agreements of 78.8% & 72.3% for lan- 
guage and arithmetic respectively using the "Standard” 
method compared with 55.6% and 64.3% using the 
"Normal Curve” method and cautiously suggests 
that the "Standard” method may be "of value”. 

However, the groups used by Smith tended to be 
large. Of the 21 groups used in his study nine 
were over 100, and seven were nearly 300 or more. 
Only six had less than 45 students. If the data 
from these six groups are considered separately the 
percentage of agreement now becomes 44.7 and 50.0 
for language and arithmetic respectively using the 
"Standard” method, and 68.2 and 51.6 using the 
"Normal Curve” method, (extracted from Smith, 



p. 12b.) I n other words for these six classes only, 
the small difference was in favour of the "Normal 
Curve” method. 

Smith’s findings unfortunately contribute little to 
the problem of whether the "Standard” method sug- 
gested by Conway can be of value to a classroom 
teacher in determining grades for his class. Few 
groups of classroom size were used in his study, 
and these supported the "Normal Curve” method. 

As the value of the "Standard” method is still in 
question, the present study was undertaken to re- 
examine Conway’s proposal. Part I examines the 
original statement and the rationale which appears to 
underlie it, and illustrates the accuracy of prediction 
by regression estimation of intervals. Part II 
examines some of the empirical results of the applica- 
tion of the "Standard” method to a large number of 
grade six classes. 



PARTI 

This section attempts to examine the rationale 
underlying Conway’s proposal to use the scholarship 
aptitude letter grade distribution as a basis for the 
achievement letter grade distribution. 

Let us assume that there exists a population of 
1,000 students to which there have been administered 
an achievement test and a scholastic aptitude test. 
The "Normal Curve” percentages have been applied 
to both sets of results and letter grades awarded to 
the raw scores. It is readily apparent that, as the 
same fixed percentages have been applied to the 
1,000 children, a correspondingly identical number of 
each letter grade will have been given for achieve- 
ment and scholastic aptitude. In other words, 50 
students will have received an A grade for achieve- 
ment and 50 an A grade for scholastic aptitude, 
200 a B grade for achievement and 200 a B grade 
for scholastic aptitude, and so on. Some of the 
students who have received an A for scholastic 
aptitude would also have gained an A in achieve- 
ment, but others would not. Similarly, of the students 
who have received a B for scholastic aptitude, 
some might have obtained an A for achievement, 
others B, and still others C+ or lower. Whatever 
the extent of agreement between the grades for the 
two tests for individual children, the number of 
students in each letter grade category would remain 
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the same, because these numbers represent given 
percentages of the student population of 1,000 
being considered. 

Suppose a sample of forty children from the popu- 
lation of 1,000 is now taken and placed in one 
class in a school. Will the same relationship be- 
tween the numbers in each letter grade category on 
the two tests still obtain? Would one, knowing the 
numbers of each letter grade awarded on one test, 
be able to predict, even approximately, the letter 
grade distribution in the class for the other test? 
This is the essence of the problem. . 

Logically, it appears difficult to justify a proposal 
that, for example, because two students have A 
scholastic aptitude grades, then two other students 
in the class be given A grades for achievement. Any 
justification appears to be contained in Conway’s 
statement that "on the average, groups of pupils 
whose scholastic aptitude is high will have high 
achievement and groups whose intelligence is low 
will have low achievement.” (Conway, p. 2) , but 
how such a statement would apply to a class con- 
sisting of children of varying ability is not clear. 

Both Conway and Smith suggest that the letter 
grade distribution for scholastic aptitude as the basis 
for establishing the achievement grades may be 
modified to accommodate gaps in the distribution of 
marks, but the extent of the modifications in the 
examples given by them is very slight. (Conway, 
p. 4; Smith, p. 9). This do°s, however raise the 
question of- how much deviation from the scholastic 
aptitude letter-grade pattern might legitimately be 
made, and the basis for such a deviation. Assuming 
that the correlation between scholastic aptitude 
and achievement is of the order of .60 to .70, what 
degree of correspondence can be expected between 
the two distributions of letter grades if they are 
obtained independently? 

For individual students, the problem just posed 
could be approached theoretically through the use of 
regression estimation of intervals, which would 
determine the range of grades for achievement which 
could be predicted for a given grade of scholastic 
aptitude. For example, since a B letter grade 

represents a percentile rank of from 75 to 94, by 
using a rank order correlation of the magnitude 
found to exist between scholastic aptitude and 
academic achievement, an estimate can be made 
of the percentile interval within which the achieve- 
ment will fall 95 times in 100 for a 75 percentile 
scholastic aptitude score. Similarly, the percentile 



range for achievement can be calculated for scholas- 
tic aptitude at the 94 percentile. A combination of 
the achievement estimates for the 75 and 94 percen- 
tiles for scholastic aptitude will give the percentile 
range for achievement to be expected from a B 
letter grade for scholastic aptitude at the .05 level 
of confidence. 

In order to obtain for the regression estimates an 
approximate measure of the relationship existing 
between scc r es on a test of scholastic aptitude and 
scores on tests of academic achievement the following 
procedure was used. Five classes, totalling 180 
students, were selected randomly from the 103 classes 
used in Part II of this study. Correlations were 
calculated between the percentile rank obtained from 
the Otis Self- Administering Test of Mental Ability 
and the percentile rank obtained from standardized 
tests of reading, language, arithmetic computation 
and science. The mean correlation of .66, obtained 
by using Fisher’s Z transformation, was considered 
to be a reasonable approximation, to use for dis- 
cussion purposes, of the relationship between a 
scholastic aptitude and academic achievement. 

However, as a correlation coefficient of the order 
of .66 indicates a large amount of scatter around the 
regression line, it is to be expected that the over- 
lapping of the predictions for achievement from 
different letter grades for scholastic aptitude will 
be very considerable. Table I shows the achievement 
interval estimates for different grades of scholastic 
aptitude at the .05 lev. - ;! of confidence, based on 
a correlation of .66 between scholastic aptitude and 
achievement. 

A reading of Table 8 shows that theoretically, 
on the basis of a correlation coefficient of .66, a 
considerable range of grades for achievement should 
be anticipated for a given scholastic aptitude grade. 
For example, from Table I it can be seen that for a 
scholastic aptitude letter grade of C+, which re- 
presents a percentile range of 74 to 60, the 
estimate is that 95 times in 100 the achievement 
letter grade will be between the percentiles 86 and 
36, to give a range of letter grades from B to C— . It 
should be noted that five times in every hundred, 
or twice in a class of forty children, the achievement 
may be expected to fall outside the estimate. 

The wide range of letter grades for achievement 
obtained by regression estimation for a given 
scholastic-aptitude letter grade clearly indicates 
the folly of attempting to predict an individual’s 
achievement grades on the basis of his scholastic 



aptitude standing alone. It also seems to raise 
serious doubts concerning the wisdom of using the 
scholastic aptitude letter-grade distribution as the 
basis for the achievement letter-grade distribution. 
For the two distributions to correspond precisely, 
every deviation in one direction from complete 
agreement of the scholastic- aptitude and achievement- 
letter grades would require compensatory deviation 
in the other direction. In other words, all examples 
of "over- achievement” by students on the basis of 
the scholastic aptitude letter grade would require 
a similar amount of "under-achievement” by other 
students to occur to compensate for it. x urthermore 
such "over- achievement” and "under- achievement” 
would have to occur with certain students only, 
in order to maintain the balance within the system. 
For example, if two students with *A* for scholastic 
aptitude 'under-achieve* and obtain *C+* achievement 
grades, then to maintain the balance two students 
with *C+* scholastic aptitude must achieve 'A’, 
or two students with 'B* scholastic aptitude achieve 
'A* and two students with 'C+* scholastic aptitude 
achieve 'B*. The extent to which such adjustments 
would occur in a group of, say, forty children in 
one class cannot be estimated, although a priori, 
such compensation would appear to be a somewhat 
unlikely occurrence, especially as many of the 
relevant variables such as the motivation of the 
children, reading ability, and the effectiveness of 
previous teaching are likely to be non-randomly 
distributed from one class to another. 

On purely logical grounds there appears to be 
little support for the proposals of Conway and Smith 
to use the scholastic aptitude letter grade 
distribution as the basis for the achievement letter 
grade distribution. Any merit in their proposal 



would appear to reside in the extent to which the 
deviations from complete agreement between the 
scholastic aptitude and achievement letter grades for 
individuals would average out within a typical 
class. This cannot be ascertained theoretically. 
Fortunately, however, there remains a simple 
empirical approach to this problem, that of examining 
school records to see- what actual 1 v occurs in Prac- 
tice. This was done in Part II of this study. 



PART II 

Conway’s proposal is based on the assumption 
that a close correspondence exists between the 
distribution of letter grades for scholastic aptitude 
and the distribution of letter grades for achievement 
in a class. This part of the study is concerned 
with an examination of the correctness of this 
assumption through a scrutiny of school records 
to determine empirically the extent of the discre- 
pancies between the distributions. 

There are two requirements for such an approach 
to the problem of the validity of using the "Standard” 
method. First, marks for achievement must not be 
contaminated by teachers* expectations of per- 
formance. In other words, the achievement test 
must either be objective or scored without knowledge 
of the scholastic aptitude grades. Second, the letter 
grades derived from the scores on the tests of 
scholastic aptitude and achievement must be strictly 
comparable, that is, they must be based on norms 
derived from the same population. Both these re- 



TABLE 1 

Predicted Grade Intervals for Achievement for 
Different Scholastic Aptitude Letter Grades (r* .66) 



Letter Grade 


Scholastic Aptitude 

Range in Percentiles 


Estimate for Achievement 
Range in Percentiles 


Letter Grade 


A 


95 or over 


99-59 


A to C 


B 


94-75 


99-46 


A to C 


C+ 


74 - 60 


86-36 


B to C- 


C 


59-40 


76-23 


B to D 


C - 


39-25 


63-12 


C+to D 


D 


24-5 


53-1 


C to E 


» E 


4 and under 


40-1 


C to E 
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quirements were met in this study by using the 
results from a battery of standardized tests with 
norms developed from the test scores of all the 
children of a given grade in a given area. 

The purpose of this part of the study, then, was 
to examine the empirical support for using the 
scholastic aptitude letter grade distribution as a 
basis for determining achievement grades. This 
was done by examining the extent of the ui.ference 
between the letter grade distribution for scholastic 
aptitude and the letter grade distribution obtained 
in each of six academic subjects, for each of a 
large number of classes in one school district. The 
letter grades in each case were obtained from district 
norms based on the results of standardized tests of 
scholastic aptitude and academic achievement. 

It is sometimes suggested that, while the scho- 
lastic aptitude letter grade distribution may not 
coincide exactly with the distribution of achievement 
letter grades, it at least groups the grades around a 
mid-point which is appropriate for the class. In 
other words, it is hypothesized that the mean of the 
scholastic ability letter grades and the mean of the 
achievement letter grades for a class will correspond 
closely, even though the two distributions of letter 
grades may be somewhat different. This hypothesis 
also was examined. 



PROCEDURE 

The procedure used in this study is first outlined 
below and then discussed. 

In one class, scores from a test of scholastic 
aptitude and from tests of achievement in six aca- 
demic subjects were converted to letter grades using 
norms established for the school district. 

For this class the distribution of the letter grades 
for scholastic aptitude was compared with the dis- 
tribution of the letter grades for achievement tn one 
of the academic subjects. 

The difference between the number of "A” letter 
grades obtained by the class on the test of scho- 
lastic aptitude and the number of "A” letter grades 
obtained on the test of achievement was calculated. 
This was repeated for each letter grade. 

This procedure was repeated in the class for a 
total of six tests of achievement. 



The whole procedure was repeated for a total of 
103 classes, and the results tabulated. 

The mean value of the letter grades for scholastic 
aptitude was also calculated for each class. This 
mean value was compared with the mean value of 
the letter grades for a test of achievement. 

Differences between the mean values were cal- 
culated and tabulated. This procedure was repeated 
for a total of four tests of academic achievement. 

The above outline delineates the essence of the 
procedure used. However, certain assumptions and 
checks were made during the study. A description 
of the procedure therefore now follows in detail. 

The records of all the grade six classes in School 
District 61 in British Columbia for the years 1961-62 
and 1962-63 were analysed. Grade six was chosen 
because of its middle position in the school grades 
and because complete results of a test of scholastic 
aptitude and standardized tests of achievement in 
language, spelling, reading, arithmetic reasoning 
arithmetic computation and science were available. ^ 
Tably 2 shows the size ...id number of the classes in 
grade six for the two years combined. 



TABLE 2 

Size of the 103 Grade Six Classes Used 



Class Size 

43- 44 
41 - 42 
39 - 40 
37 - 38 
35 - 36 
33 - 34 
31 - 32 
29 - 30 
21 - 28 
14-20 



Frequency 

3 

5 

15 

12 

13 

18 

11 

7 

7 

12 



N - 103 



The scores for the test of scholastic aptitude and 
for each of the tests of achievement were translated 
into the letter grades, A, B, C+, C, C-, D or E 
using the district norms. A comparison between the 
letter grade distribution for scholastic aptitude 
and the letter grade distribution for a test of achieve- 

1. See Appendix A. 
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